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the IBM S/390 Parallel Enterprise Server - Generation 3. After an introduction to the 
concepts of static timing analysis, we describe the timing-modeling for the gates and 
interconnects, explain the optimization schemes and present obtained results. 
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Variability of circuit performance is becoming a very important issue for ultra-deep sub- 
micron technology. Gate length variation has the most direct impact on circuit performance. 
Since many factors tontribute to the variability of gate length, recent studies have modeled 
the variability using Gaussian distributions. In reality, the through-pitch and through-focus 
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A IGOPS dynamically reconfigurable processing unit with embedded Flash memory and 
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and FPGA bitstreams are stored in the embedded Flash memory and are independently 
accessible through 3 content-specific, 64-bit I/O ports with a peak read rate of 1.2GB/S. The 
system is implemented in a b.48um, 2PL-6ML CMOS Flash technology, chip area is 70mm2. 
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This paper discusses challenges the designer faces in integrating entire system product 
designs, containing tens or even hundreds of millions of logic gates, Into single chip 
solutions now within reach using circuit densities possible in the latest silicon technologies. 
Managing designs of this size presents a new dimension of issues, and managing the 
physical and electrical effects of these high density device geometries presents another; 
solutions in both these^areas are presented. Lastly, thi ... 



Keywords: design productivity, methodology, power management, signal integrity, system- 
on-chip, time to market 



10 Highlights of ISSCC and the ciesign of state-of-the-art microprocessors: A 1 .5GHz third Q 
g eneration itaniunn® 2 processor 
^ Jason Stinson, Stefan Rusu 

V June 2003 Proceedings of the 40th conference on Design automation 
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This 130nm Itanium® 2 processor implements the Explicitly Parallel Instruction Computing 
(EPIC) architecture and features an bn-dle 6MB, 24-waY set associative L3 cache. The 
374mm2 die contains 410M transistors and is implemented in a dual-Vt process with 6 
layers copper interconnects and FSG dielectric. The processor runs at 1.5GHz at 1.3V and 
dissipates a maximumibf 130W. This paper reviews circuit design and package details, 
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independent, port based, supports designs with level triggered latches, and can be input into 
commercial STA (Static Timing Analysis) tools. The model is based on an extension of the 
concept of latch transparency to circuit block transparency introduced in this paper. It was 
implemented, tested and is being used in conjunction with transistor level STA for 
microprocessor designs with tens of millions of ... 
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This paper describes the implementation of a novel path- based learning methodology that 
can be applied for two purposes: (1) In a pre-sllicon simulation environment, path-based 
learning can be used to produce a fast and approximate simulator for statistical timing 
simulation. (2) In post-silicon phase, path-based learning can be used as a vehicle to derive 
critical paths based on the pass/fall behavior observed from the test chips. Our path-based 
learning methodology consists of four major compo ... 
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As processing technology scales down to the nanometer regime, capacitive crosstalk is 
having an increasingly adverse effect on circuit functionality, leading to increasing number 
of chip failures. In this paper, we propose mapping the static crosstall< functional noise 
problem into the well understood static timing problem. The key differences between static 
noise and static timing analyses, namely the injection of noise, accurate noise window 
propagation and register sensitive window computation ... 

Keywords: crosstalk, noise, signal integrity 



14 Design for manufacturability and global routing: A cost-driven lithographic correction Q 
methodolo g y based on off-the-shelfsizing tools 
P. Gupta, A. B. Kahng, D. Sylvester, J, Yang ^ 
June 2003 Proceedings of the 40th conference on Design automation 

Full text available* 1p!lpdf(149 73 KB) Additional Information: full citation , abstract , references , citings , index 

As minimum feature sizes continue to shrink, patterned features have become significantly 
smaller than the wavelength of ligKt used In optical lithography. As a result, the requirement 
for dimensional variation control, especially In critical dimension (CD) 3a, has become more 
stringent. To meet these requirements, resolution enhancement techniques (RET) such as 
optical proximity correction (OPC) and phase shift mask (PSM) technology are applied. 
These approaches result \rva substantial inc ... 
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^5 Issues in crosstalk: Efficient switching window computation for cross-talk noise Q 
^Bhavana Thudi, David Blaauw 

December 2002 Proceedings of the 8th ACM/IEEE international workshop on Timing 
issues in the specification and synthesis of digital systems 

Full text available- 133 pdf(1 82 45 KB) Additional Information: full citation , abstract , references , citings , index 
. : ^ terms 

In this paper, we present an efficient method for computing switching windows in the 
presence of delay noise. In static tiniing analysis, delay noise has traditionally been modeled 
using a simple switch-factor based/ndise model and the computation of switching windows is 
performed using an iterative algorithm where timing window propagation and switch factor 
updates are computed repeatedly until convergence. It was shown that the worst-case 
number of iterations required for convergence is 0(n< ... 

16 New directions in timing analysis: From blind certainty to informed uncertainty Q 
Kurt Keutzer, Michael Orshansky 

December 2002 Proceedings of the 8th ACM/IEEE international workshop on Timing 
issues in the specification and synthesis of digital systems 

Full text available: ^ pdf(243.42 KB) Additional Information: full citation, abstract, references , ciflngs, index 

V terms 

The accuracy, computational efficiency, and reliability of static timing analysis have made it 
the workhorse for verifying the timing of synchronous digital integrated circuits for more 
than a decade. In this paper we charge that tfiextraditional deterministic approach to 
analyzing the timing of circuits Is significantly undermining Its accuracy and may even 
challenge its reliability. We argue that computation of the static timing of a circuit requires a 
dramatic rethinking In order to continue s ... 

17 Delay and noise modeling in the nanometer regime: Non-iterative switching window Q 
computation for delay-noise 

Bhavana Thudi, David Blaauw 

June 2003 Proceedings of the 40th conference on Design automation 

Full text available: ^ pdfd 38.38 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we present an efficient method for computing switching windows in the 
presence of delay noise. In static timing analysis, delay noise has traditionally been modeled 
using a simple switch-factor based noise model and the computation of switching windows is 
performed using an iterative algorithm, resulting In an overall run time of 0(n2), where n is 
the number of gates in the circuit. It has also been shown that the Iterations converge to 
different solutions, depending on the initia ... 

Keywords: cross-talk noise, superposition, switching window 



1^ Delay estimation VLSI circuits from a high-level view 
Mahadevamurty Neman!, Farid N. Najm 

May 1998 Proceedings of the 35th annual conference on Design automation - Volume 
00 

Full text available: ■ apdf(180.14 KB ) 

S Additional Information: full citation , abstract , references , index terms 

^ Publisher Site 

Estimation of the delay of a Boolean function from its functional description is an important 
step towards design exploration at the register transfer level (RTL). This paper addresses 
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the problem of estimating the delay of certain optimal multi-level implementations of 
combinational circuits, given only their functional description. The proposed delay model 
uses a new complexity measure called the delay measure to estimate the delay. It has an 
advantage th ... 

19 Functional timing analysis for IP characterization 

Hakan Yalcin, Mohammad Mortazavi, Robert Palermo, Cyrus Bamji, Karem Sakallah 
June 1999 Proceedings of the 36th ACM/IEEE conference on Design automation 

Full text available: ^pdf (647.16 KB) Additional Information: full citation , references , citings, index terms 



Keywords: IP characterization, false path, functional (mode) dependency, timing analysis 



20 The usage of stochastic processes in embedded system specifications 
Axel Jantsch, Ingo Sander, Wenbiao Wu 

April 2001 Proceedings of the ninth international symposium on Hardware/software 
codesign 

.^^^m^mru^ Ad*naMn,or.a«o„:M|*n.«^,<— 

We review the use of nondeterminism and identify two different purposes. The descriptive 
purpose handles uncertainties in the behaviour of existing entities. The constraining purpose 
is used in specifications to constrain implementations. For the specification of embedded 
systems we suggest a stoctiastic processor instead of nondeterminism. It serves mostly the 
descriptive purpose but can also be used to constrain the system. We carefully distinguish 
different interpretati ... 
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Crosstalk Noise: On convergence of switching windows computation in presence of Q 
crosstalk noise 

Pinhong Chen, Yuji Kukimoto, Chin-Chi Teng, Kurt Keutzer 

April 2002 Proceedings of the 2002 international symposium on Pliysicai design 

Full text available: pdf(1 14.76 KB) Additional Information: full citation , abstract , references , citings , index 
* terms 

Detecting overlapping of switching windows between coupling nets is a major static 
technique to accurately locate crosstalk noise. However, due to the mutual dependency 
between switching windows, the computation requires iterations to converge. In this paper, 
we discuss the issues and provide answers to the important questions involved in 
convergence and numerical properties, including the effect of coupling models, multiple 
convergence points, convergence rate, computational complexity, non-mon ... 

22 Functional correlation analysis in crosstalk induced critical paths identification Q 
Tong Xiao, Malgorzata Marek-Sadowska 

June 2001 Proceedings of the 38th conference on Design automation 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^pdf(60.06 KB) 



In deep submicron digital circuits capacitive couplings make delay of a switching signal 
highly dependent on its neighbors switching times and switching directions. A long path may 
have a large num-ber of coupling neighbors with difficult to determine interdepen-dencies. 
Ignoring the mutual relationship among the signals may result in a very pessimistic 
estimation of circuit delay. In this paper, we apply efficient functional correlation analysis 
techniques to identify critical paths caused ... 

23 False path exclusion in delay analysis of RTL-based datapath-controller designs 
C. Papachrlstou, M. IMourani 

September 1996 Proceedings of the conference on European design automation 

Full text available: ^ pdf(334.54 KB) Additional Information: full citation , references , index terms 



Timing analysis with crosstalk as fixpoints on complete lattice Q 
Hal Zhou, Narendra Shenoy, William Nicholls 

June 2001 Proceedings of the 38th conference on Design automation 
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Increasing delay variation due to crosstalk has a dramatic impact on deep sub-micron 
technologies. It is now necessary to include crosstalk in timing analysis. But timing analysis 
with crosstalk is a chicken-and-egg problem since crosstalk effect in turn depends on timing 
behavior of a circuit. In this paper, we establish a theoretical foundation for timing analysis 
with crosstalk. We show that solutions to the problem are fixpoints on a complete lattice. 
Base on that, we prove in general t ... 



25 A new gate delay model for simultaneous switching and its applications 
Liang-Chi Chen, Sandeep K. Gupta, Melvin A. Breuer 
June 2001 Proceedings of the 38th conference on Design automation 

Additional Information: full citation , abstract , references , citings, index 
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^® Timing verification of sequential domino circuits 

David Van Cannpenhout, Trevor Mudge, Karem A. Sakallah 

January 1997 Proceedings of the 1996 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^ rpjl 

paT(«t3.^b Kbj,^ Additional Information: full citation , abstract , references , index terms 
Publisher Site 

Two methods are presented for static timing verification of sequential circuits implemented 
as a mix of static and domino logic. Constraints for proper operation of domino gates are 
derived. An important observation is that input signals to domino gates may start changing 
near the end of the evaluate phase. The first method models domino gates explicitly, similar 
to latches. The second method treats domino gates only during pre- and post-processing 
steps. This method is shown to be more conserva ... 

Keywords: domino gates, input signals, logic testing, sequential domino circuits, static 
timing verification 



27 Session 3A: Sequential synthesis: Placement driven retiming with a coupled edge 
timin g model 

Ingmar Neumann, Wolfgang Kunz 

November 2001 Proceedings of the 2001 IEEE/ACM international conference on 
Computer-aided design 

Full text available: f ?! pdfd 36.00 KB) Additional Information: full citation , abstract, references , dtiogs. index 
^ terms 

Retiming is a widely investigated technique for performance optimization. It performs 
powerful modifications on a circuit netlist. However, often it is not clear, whether the 
predicted performance improvement will still be valid after placement has been performed. 
This paper presents a new retiming algorithm using a highly accurate timing model taking 
into account the effect of retiming on capacitive loads of single wires as well as fanout 
systems. We propose the Integration of retiming into at... 

^® Clock Scheduling and Clocktree Construction for Hi g h Performance ASICS 
Stephen Held, Bernhard Korte, Jens MaBberg, Matthias Ringe, Jens Vygen 
November 2003 Proceedings of the 2003 lEEE/ACi^ international conference on 
Computer-aided design 

Full text available: ^ pdf(384.60 KB) Additional Information: full citation , abstract . Index terms 
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In this paper we present a new method for clock schedulingand clocktree construction that 
improves the performance ofhigh-end ASICs significantly. First, we compute a clock schedule 
that yields the optimumcycle time and the best possible clock distribution with respectto 
early and late mode; in particular the number of criticaltests is minimized. Second, 
individual arrival time intervalsare computed for all endpoints of the clocktree. Finally, 
weconstruct a clocktree that realizes arrival times w ... 



29 (S pecial session) embedded tutorial: DFM in Nm-process generation: Toward 
stochastic design for digital circuits: statistical static tinning analysis 
Shuji Tsukiyama 

January 2004 Proceedings of the 2004 conference on Asia South Pacific design 
automation: electronic design and solution fair 2004 

Full text available: ^ pdf(1 97.96 KB) Additional Information: full citation , abstract , references 

Due to the process variations and the variations of environmental factors such as supply 
voltage and temperature, the circuit parameters and hence the circuit performance such as 
delay fluctuate, and their variability and uncertainty are increasing in the deep sub-micron 
technology. Therefore, producing high performance digital circuits in high yield becomes 
difficult more and more. Various efforts have been done in order to analyze and reduce such 
fluctuations. Among them, statistical static ti ... 



30 Mana ging power and performan ce for S ystem-on-Chip designs using Voltage Islands Q 
David E. Lackey, Paul S. Zuchowski, Thomas R. Bednar, Douglas W. Stout, Scott W. Gould, 
John M. Cohn 

November 2002 Proceedings of the 2002 IEEE/ACM international conference on 
Computer-aided design 

Full text available: tgl Ddf(96.51 KB) Additional Information: full citation , abstract, references , dtiogs, index 
^ terms 

This paper discusses Voltage Islands, a systenn architecture and chip implementation 
methodology, that can be used to dramatically reduce active and static power consumption 
for System-on-Chip (SoC) designs. As technology scales for increased circuit density and 
performance, the need to reduce power consumption increases in significance as designers 
strive to utilize the advancing silicon capabilities. The consumer product market further 
drives the need to minimize chip power consumption. Effectiv ... 



Statistical timing analysis: First-order incremental block-based statistical tinnin g 
analysis 

C. Visweswariah, K. Ravindran, K. Kalafala, S. G. Walker, S. Narayan 

June 2004 Proceedings of the 41st annual conference on Design automation 

Full text available: ^ pclf(215.65 KB) Additional Information: full citation , abstract , references , index terms 

Variability in digital integrated circuits makes tinning verification an extremely challenging 
task. In this paper, a canonical first order delay model is proposed that takes into account 
both correlated and independent randomness. A novel linear-time block-based statistical 
timing algorithm is employed to propagate timing quantities like arrival times and required 
arrival times through the timing graph in this canonical form. At the end of the statistical 
timing, the sensitivities of all timing ... 



Keywords: incremental, statistical timing, variability 



32 Issues in timing analysis: Worst-case circuit delay taking into account power supply 
variations 

Dionysios Kouroussis, Rubil Ahmadi, Farid N. Najm 

June 2004 Proceedings of the 41st annual conference on Design automation 
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Current Static Timing Analysis (STA) techniques allow one to verify the timing of a circuit at 
different process corners which only consider cases where all the supplies are low or high. 
This analysis may not give the true maximum delay of a circuit because it neglects the 
possible mismatch between drivers and loads. We propose a new approach for timing 
analysis in which we first identify the critical path(s) of a circuit using a power-supply-aware 
timing model. Given these critical paths, we th ... 

Keywords: power grid, static timing analysis, voltage fluctuations 



33 Timing verification on a 1 .2M-device full-custom CMOS design 
Jengwei Pan, Larry Biro, Joel Grodstein, Bill Grundmann, Yao-Tsung Yen 
June 1991 Proceedings of the 28th conference on ACM/IEEE design automation 

Full text available: ^ pdf(333.48 KB) Additional Information: full citation , references , index terms 



34 Development of ASIC Chip-Set for High-End Network Processing Application-A Case Q 
Study 

Sanjeev Patel 

January 2002 Proceedings of the 2002 conference on Asia South Pacific design 
automation/VLSI Design 

Ddfd 69.08 KB) 

^Additional Information: full citation , abstract 
Publisher Site 

Choosing the right methodology is a significant step towards successful VLSI designs. 
Traditional methodologies and tools are no longer adequate to handle large and complex 
designs. This paper presents a novel design methodology for complex deep-submicron 
designs, using a case study of the development of a high-end network processing ASIC chip- 
set. The paper focuses on the synergetic use of the "dual design verification approach", 
along with static verification methods in achieving defect free s ... 



35 Session 8A: static timing analysis: Transistor-level timing analysis using ennbedded 
simulation 

Pawan Kulshreshtha, Robert Palermo, Mohammad Mortazavi, Cyrus Bamji, Hakan Yalcin 
November 2000 Proceedings of the 2000 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^ pdf(86.18 KB) Additional Information: full citation , abstract , references 

A high accuracy system for transistor-level static timing analysis is presented. Accurate 
static timing verification requires that individual gate and interconnect delays be accurately 
calculated. At the sub-micron level, calculating gate and interconnect delays using delay 
models can result in reduced accuracy. Instead, the proposed method calculates delays 
through numerical integration using an embedded circuit simulator. It takes into account 
short circuit current and carefully chooses the se ... 



36 Provablv correct high-level tinninq analysis without path sensitization 
Subhrajit Bhattacharya, Sujit Dey, Franc Brglez 

November 1994 Proceedings of the 1994 lEEE/ACi^ international conference on 
Computer-aided design 

Full text available: ■ p|pdf(820.65 KB) Additional Information: full citation , abstract, references , dtings. index 

terms 

This paper addresses the problenn of true delay estimation during high level design. The 
existing delay estimation techniques either estimate the topological delay of the circuit 
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which may be pessimistic, or use gate-level timing analysis for calculating the true delay, 
which may be prohibitively expensive. We show that the paths in the implementation of a 
behavioral specification can be partitioned into two sets, SP and UP. While the paths in SP 
can affect the delay of the circuit ... 

37 Timing analysis and verification: SLOCOP-II: a versatile timing verification system for Q 
MOSVLSI 

P. Johannes, P. Das, L Claesen, H. De l^an 

March 1990 Proceedings of the conference on European design automation 

Full text available: ^pdf(419.26 KB) Additional Information: full citation , abstract , references 

The new SLOCOP-II tinning verification system for the accurate performance analysis of 
MOSVLSI circuits is being presented. The algorithms in SLOCOP-II solve the serious problem 
of "false paths" that occur in all existing timing verifiers, by taking into account the logic 
functionallity of the circuits at hand. To allow this for custom MOSVLSI designs, new event 
determination algorithms based on binary decision tree (BDT) have been developed and are 
presented in this paper. The algorithms to avoi ... 

Keywords: Static timing verification 



38 Timing Analysis in Presence of Power Supply and Ground Voltage Variations 
Rubil Ahmadi, Farid N. Najm 

November 2003 Proceedings of the 2003 IEEE/ACM international conference on 
Computer-aided design 

Full text available: 'g pdf(368.16 KB) Additional Information: full citation , abstract , index terms 

Given the sensitivity of circuit delay to supply and ground voltagevalues, static timing 
analysis (STA) must take into account supplyvoltage variations. Existing STA techniques 
allow one to verifythe timing at different process corners which effectively only considers 
cases where all the supplies are low or all are high. Cases of mismatch between the supplies 
of driver and load are not considered. In practice, supply voltages are neither totally 
independentnor totally dependent. In this work, we ... 

39 IEEE 1394a 2000 Physical Layer ASIC 
Ranjit Yashwante, Bhalchandra Jahagirdar 

January 2002 Proceedings of the 2002 conference on Asia South Pacific design 
automation/VLSI Design 

Full text available: "S pdrd 69.35 KB) 

jST ^ Additional Information: full citation , abstract 

W Publislier Site 

CN4011A is IEEE 1394a_2000 standard Compliant Physical Layer ASIC. It is a O.lSum 
mixed-signal ASIC incorporating three analog ports, PLL, reference generator for analog 
along with the digital control logic. Whole ASIC from specification to GDSII including analog 
was designed at Controlnet (I) Pvt. Ltd CN4011A has three 1394a_2000 fully compliant 
ports that support data transfers at 100/200/400 Mbps. It supports repeating of data over 
on all ports other then receiving port. It's Interoperable wi ... 

40 Compiler-directed data prefetching in multiprocessors with memory hierarchies 
Edward H. Gornish, Elana D. Granston, Alexander V. Veidenbaum 

June 1990 ACM SIGARCH Computer Architecture News , Proceedings of the 4th 

international conference on Supercomputing, volume i8 issue 3 
Full text available* ^ pdf(1.53 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

Memory hierarchies are used by multiprocessor systems to reduce large memory access 
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times. It is necessary to automatically manage sucli a hierarchy, to obtain effective memory 
utilization. In this paper, we discuss the various issues involved in obtaining an optimal 
memory management strategy for a memory hierarchy. We present an algorithm for finding 
the earliest point in a program that a block of data can be prefetched. This determination is 
based on the control and data dependencies in t ... 
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Circuit effects in static timing: Osculatin g Thevenin model for preciicting delay and slew 
of capacitively characterized cells 
Bernard N. Sheehan 

June 2002 Proceedings of the 39th conference on Design automation 

Full text available: ^ pdf(76.62 KB) Additional Information: full citation , abstract , references , index terms 

To extrapolate from one point to another using a line, one had better get the slope right. In 
this paper we apply a similar concept to the important problem in Static Timing Analysis 
(STA) of predicting cell timing for RC loads using capacitive characterization data. Instead of 
a line we have a Thevenin circuit, and instead of matching slopes we match load 
sensitivities. We present a table driven, highly accurate cell delay and slew prediction 
procedure that can improve STA when interconnect eff ... 



Keywords: effective capacitance, static timing analysis 



42 A design flow for partially reconfi g urable hardware 
Ian Robertson, James Irvine 

May 2004 ACM Transactions on Embedded Computing Systems (TECS), volume 3 issue 2 
Full text available: ^ pdf(698.30 KB) Additional Information: full citation, abstract , references , index terms 

This paper presents a top-down designer-driven design flow for creating hardware that 
exploits partial run-time reconfiguration. Computer-aided design (CAD) tools are presented, 
which complement conventional FPGA design environments to enable the specification, 
simulation (both functional and timing), synthesis, automatic placement and routing, partial 
configuration generation and control of partially reconfigurable designs. Collectively these 
tools constitute the dynamic circuit switching CAD f ... 

Keywords: FPGA, Viterbi decoder, configuration control, dynamically reconfigurable logic 
(DRL), power estimation, run-time reconfiguration (RTR) 



43 Efficient Generation of Delay Change Curves for Noise-Aware Static Timing Analysis 
Kanak Agarwal, Yu Cao, Takashi Sato, Dennis Sylvester, Chenming Hu 
January 2002 Proceedings of the 2002 conference on Asia South Pacific design 
automation/VLSI Design 
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In this paper, we explore the concept of using analytical models to efficiently generate delay 
change curves (DCCs) that can then be used to characterize the impact of noise on any 
victim/aggressor configuration. Such an approach captures important noise considerations 
such as the possibility of delay change even when the switching windows of neighboring 
gates do not overlap. The technique is model-independent, which we demonstrate by using 
several crosstalk noise models to obtain results. Furth ... 

^ Issues in crosstalk: Clock schedule verification with crosstalk 
Hai Zhou 

December 2002 Proceedings of the 8th ACM/IEEE international workshop on Timing 
issues in the specification and synthesis of digital systems 

Full text available: ^ pdfd 57.54 KB) Additional Information: full citation , abstract , references , index ternns 

Delay variation due to crosstalk has made tinning analysis a hard problem. In sequential 
circuits with transparent latches, crosstalk makes the clock schedule verification even 
harder. In this paper, we point out a false negative problem in current clock schedule 
verification techniques and propose a new approach based on switching windows. In this 
approach, coupling delay calculations are naturally combined with latch iterations. A novel 
algorithm is given for clock schedule verification in the ... 

Keywords: clock schedule, coupling, delay, verification 



45 Global harmony: coupled noise analysis for full-chip RC interconnect networks Q 
K. L Shepard, V. Narayanan, P. C. Elmendorf, Gutuan Zheng 

November 1997 Proceedings of the 1997 IEEE/ACM international conference on 
Computer-aided design 

Full text available: J.pdf( 238.78 KB ) Additional Information: full citation , abstract , references , citings , index 

M Publisher Site l^HDS 

Noise is becoming one of the most important metrics in the design of VLSI systems, 
certainly of comparable importance to area, timing, and power. In this paper, we describe 
Global Harmony, a methodology for the analysis of coupling noise in the global interconnect 
of large VLSI chips, being developed for the design of high-performance microprocessors. 
The architecture of Global Harmony involves a careful combination of static noise analysis, 
static timing analysis, and reduced-order modelling te ... 

Keywords: noise, static timing analysis, interconnect 



A spacing algorithm for performance enhancement and cross-talk reduction 
Kamal Chaudhary, Akira Onozawa, Ernest S. Kuh 

November 1993 Proceedings of the 1993 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^ pdf(621.94 KB) Additional Information: full citation, references , citings 



A crosstalk-aware timing-driven router for FPGAs 
Steven J. E. Wilton 

February 2001 Proceedings of the 2001 ACM/SIGDA ninth international symposium on 
Field programmable gate arrays 
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As integrated circuits are migrated to more advanced technologies, it has become clear that 
crosstalk is an important physical phenomenon that must be taken into account. Crosstalk 
has primarily been a concern for ASICs, multi-chip modules, and custom chips, however, it 
will soon become a concern in FPGAs. In this paper, we describe the first published 
crosstalk- a ware router that targets FPGAs. We show that, in a representative FPGA 
architecture implemented in a 0.18mm technology, the avera ... 

Keywords: crosstalk, field-programmable gate arrays, routing algorithms 



48 Interface timing verification drives system design 
Ajay J. Daga, Peter R. Suaris 

June 1997 Proceedings of the 34th annual conference on Design automation - Volume 



System design, i.e. the design of board-level circuits andsystems-on-a-chip, focuses on the 
integration of off-the-shelf andapplication-specific VLSI components. A key aspect of 
systemdesign is to ensure the satisfaction of component interface timingrequirements. This 
is necessary for the correct exchange ofinformation among components on a system. We 
present amethodology for the interface timing verification and subsequenttiming-driven 
floorplanning of systems. We present results on theapplicat ... 

49 A new concept for accurate modeling of VLSI interconnections and its a p plication for Q 
timing simulation 

B. Wunder, G. Lehmann, K. Muller-Glaser 

September 1996 Proceedings of the conference on European design automation 

Full text available: ^pdf(415.21 KB) Additional Information: full citation , references , index terms 



Coping with variability: the end of deterministic desi g n: Death, taxes and failing chi ps Q 
Chandu VIsweswariah 

June 2003 Proceedings of the 40th conference on Design automation 



In the way they cope with variability, present-day methodologies are onerous, pessimistic 
and risky, all at the same time! Dealing with variability is an increasingly important aspect 
of high-performance digital integrated circuit design, and indispensable for first-time-right 
hardware and cutting-edge performance. This invited paper discusses the methodology, 
analysis, synthesis and modeling aspects of this problem. These aspects of the problem are 
compared and contrasted in the ASIC and custom ... 
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This paper reports on a method for extending existing VHDL design and verification software 
available for the Xilinx Virtex series of FPGAs. It allows the designer to apply standard 
hardware design and verification tools to the design of dynamically reconfigurable logic 
(DRL). The technique involves the conversion of a dynamic design into multiple static 
designs, suitable for input to standard synthesis and APR tools. For timing and functional 
verification after APR, the sections of the design c ... 
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We present an efficient technique to reduce the switching activity in a technology-mapped 
CMOS combinational circuit based on local logic transformations. The transformations 
consist of adding redundant connections or gates so as to reduce switching activity. We 
describe simple and efficient procedures, based on logic implication, for identifying the 
sources and targets of the redundant connections. Additionally, we give procedures that 
permit the designer to trade-off power and delay after ... 
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An adaptive timing-driven layout system, called JUNE, has been developed. The constructive 
algorithm, which combines placement with the global routing, constructs a placement 
satisfying timing and routability constraints. The placement problem for each macro is 
solved hierarchically as a sequence of two optimization problems followed by an adaptive 
correction procedure. Experimental results for industrial sea-of-gates chips confirmed 
effectiveness of this approach. 
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on global interconnects. In this paper, we present a practical solution for simultaneous 
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retiming and multilevel global placement for performance optimization, based on the theory 
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Datapath design is one of the most critical elements in the design of a high performance 
microprocessor. However datapath design is typically does manually, and is often custom 
style. This adversely impacts the overall productivity of the design team, as well as the 
quality of the design. In spite of this, very little automation has been available to the 
designers of high performance datapaths. In this paper we present a new "macro-driven" 
approach to the design of datapath circ ... 
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Accurate gate-level static timing analysis in the presenceof RC loads has become an 
important problem formodern deep-submicron designs. Non-capacitive loadsare usually 
analyzed using the concept of an effectivecapacitance, Ceff. Most published algorithms for 
Ceff,however, require special cell characterization orsupplemental information that is not 
part of standardtiming libraries. In this paper we present a novel Ceffalgorithm that is 
strictly compatible with existing timinglibraries. It is also ... 
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Logic optimization procedures principally attennpt to optinnize three criteria: performance, 
area and testability. The relationship between area optimization and testability has recently 
been explored. As to the relationship between performance and testability, experience has 
shown that performance optimizations can, and do in practice, introduce single stuck-at- 
fault redundancies into designs. Are these redundancies necessary to increase performance 
or are they only an unnecessary byproduct ... 



59 Power supply, voltage, and frequency management: Dynamic voltage and frequency Q 
scaling based on workload decomposition 
Kihwan Choi, Ramakrishna Soma, Massoud Pedram 

August 2004 Proceedings of the 2004 international symposium on Low power 
electronics and design 

Full text available: ^pdf(416.31 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a technique called "workload decomposition" in which the CPU workload 
is decomposed in two parts: on-chip and off-chip. The on-chip workload signifies the CPU 
clock cycles that are required to execute instructions in the CPU whereas the off-chip 
workload captures the number of external memory access clock cycles that are required to 
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perform external memory transactions. When combined with a dynamic voltage and 
frequency scaling (DVFS) technique to minimize the energy consumpt ... 
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This paper presents a voltage scaling approach that is based on an enhanced variant of 
c/usterec/ voltage scaling originally proposed by Usami and Horowitz ([1]) The results show 
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speed and quality of results. Data are validated through power and timing analysis 
performed with a commercial tool. 
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